FOUR PERSPECTIVES ON SECONDARY TERMS IN THE 
DAVENPORT-HEILBRONN THEOREMS 



FRANK THORNE 

Abstract. This paper is an expanded version of the author's lecture at the Integers Confer- 
ence 2011. We discuss the secondary terms in the Davenport-Heilbronn theorems on cubic fields 
and 3-torsion in class groups of quadratic fields. Such secondary terms had been conjectured 
by Datskovsky- Wright and Roberts, and proofs of these or closely related secondary terms were 
obtained independently by Bhargava, Shankar, and Tsimerman [7], Hough [2^, Zhao [H], and 
Taniguchi and the author [36j . 

In this paper we discuss the history of the problem and highlight the diverse methods used in 
[ZlllSllllISn] to address it. 



1. Introduction 
This paper concerns the fohowing two theorems. 
Theorem 1.1. Let N^{X) count the number of cubic fields K with < ibDisc(Er) < X. We have 

(1.1) N^{X) = C^-^X + ^ ^(^5/6) 

^ ' ' 12C(3) 5r(2/3)3C(5/3) ^ 

where = 3, C+ = 1, K' = ^3, and K+ = 1. 

Theorem 1.2. For any quadratic field with discriminant D, let Cl3{D) denote the 3-torsion sub- 
group of the ideal class group Cl{Q{^/D)), we have 

(-) „ #c>3(z„ . . n (- ^) ^ 

and where the sum ranges over fundamental discriminants D, the product is over all primes, and 
the constants are as before. 

The main terms are due to Davenport and Heilbronn [TS], and the secondary term in Theorem 
11.11 was conjectured by Roberts [3T] and imphcitly by Datskovsky and Wright [T3]. 

The secondary terms were expected to be difficult to prove. However, progress has recently made 
in four independent works, using a variety of methods. Both results above have been proved by 
Bhargava, Shankar, and Tsimerman [7], using the geometry of numbers, and by Taniguchi and the 
present author [36j, using Shintani zeta functions. Hough ^3] has proved a variation of Theorem 
11.21 by studying the distribution of Heegner points in the upper complex plane, and Zhao [44] has 
obtained a variation of Theorem II .11 for function fields, using algebraic geometry^ 

The author's lecture at the 2011 Integers Conference explained (briefly) how each of these four 
approaches all shed light on these secondary terms, and this paper is an expanded version of 



^A proof of Theorem 1 1 . 2 1 does not appear in [7], but the authors have shown me a proof. The error terms in [23] 
and [13] are larger than the secondary terms at present, but both of these approaches naturally explain the secondary 
term, and both authors are currently working on refining their methods. 
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our lecture. We begin with some background on counting fields and on counting cubic fields in 
particular. The reader might skip to Section [4] for our discussion of the secondary terms. 

2. Counting Fields in General 

How does one count number fields? This question has been addressed in many excellent exposi- 
tory accounts, such as that by Cohen, Diaz y Diaz, and Olivier |10j . or Chapter 6 of Bhargava's ICM 
proceedings article [5]. Therefore, our overview will be brief. To a large extent we concentrate on 
the "trivial" problem of counting quadratic fields, as some observations on this problem anticipate 
the methods developed to count cubic fields with good error terms. 

The two most important theorems in the subject were proved by Minkowski and Hermite: 

Theorem 2.1 (Hermite). There are only finitely many number fields of bounded discriminant. 
Theorem 2.2 (Minkowski). The discriminant of a number field K of degree n satisfies 

(2.1) |Disc(i^)|>(^)'(j)". 

If we write n = r + 2s, where r is the number of real embeddings of K, and s is the number of 
pairs of complex embeddings, we may replace (7r/4)"' with (tt/A)'^^. Moreover, by Stirling's formula 
n! ~ \/2vrn(^) and therefore 

(2.2) |Disc(i^)| > (e^ • ^ + o(l))" = ^5.803 • • • + o(l)) 

The constant can be improved; see Odlyzko [28]. Conversely, this bound is sharp apart from 
the constant; Golod and Shafarevich [21] proved the existence of an infinite class field tower of 
fields K for which |Disc(A')J^/'^ is constant0 

To prove these theoremqj, use the r embeddings X M and the s pairs of embeddings K ^ C 
to embed the ring of integers of K as a, lattice in n-dimensional space. As proved by Minkowski, any 
convex body centered around the origin, whose volume is greater than 2"|Disc(i<')|, must contain 
a lattice point other than zero. However, this point must have norm at least 1, and geometric 
considerations allow one to conclude Minkowski's lower bound. 

Hermite's theorem can be proved in a similar way. We again use Minkowski's convex body 
theorem to find an element a £ Ok, the sizes of whose complex embeddings are all small, and which 
is guaranteed to generate K over Q. The minimal polynomial of a is equal to Ilcrl"'^ ~ '^('^))' where 
a ranges over all real and complex embeddings of K, and the bounds obtained from Minkowski's 
theorem yield bounds on the coefficients of the minimal polynomial of a. For a fixed discriminant 
bound, there are only finitely many such polynomials, and thus only finitely many possibilities for 
K. 

These classical theorems suggest that it is natural to count fields by degree. In what follows, let 
Nn{X) be the number of fields K of degree n with |Disc(i^)| < X. (Results are also known when 
the sign of the discriminant, or more generally the number of real embeddings, is specified.) 

We have the following results (excluding n = 3): 

n = 1. There is only Q. 

^Martinet [5S] gave the example oi F — Q(Cii + CiVi 46), which he proved to have an infinite 2-class field 
tower, for which |Disc(_ft')|^''" = 92.2- • ■ for each field K. We refer to Lemmermeyer [24] for a discussion of related 
issues, along with a very thorough bibliography. 

^See [27], Ch. III. 2 for complete proofs. 
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n = 2. We have the equahty of Dirichlet series 



(2.3) 



E |Disc(i^)| 



s 



1 + 1 - 2"' + 2 • 4 



s 



) 



C(^) 



[K:Q]=2 



a2s) 



and it follows that 



(2.4) 



N2{X) 



-^X + 0{VX). 



The typical proof of (j2.4p estimates, for each squarefree odd the number of integers n divisible 
by satisfying the 2-adic conditions implied by (j2.4|) , and then uses inclusion-exclusion to sieve for 
squarefreeness. The details are commonly left as an exercise for beginning analytic number theory 
students. However, it was not until the recent work of Belabas, Bhargava, and Pomerance [2J that 
the analogous method was developed to yield power-saving error terms for N'i,{X\ 

How does one estimate the number of integers n with < n < X and q^\nl. (For simplicity of 
explanation, we ignore the 2-adic conditions.) One way is to interpret it as a lattice-point counting 
problem in a one-dimensional vector space, and estimate the number of such n by ^ times the 
length of the interval (0, X'). This is of course trivial, but its generalization to the cubic case, where 
one counts GL2(Z)-orbits on a four-dimensional lattice, satisfying local conditions (mod g^) and a 
global condition < ibDisc(/) < X, is not. (The higher dimensional analogues are still less so!) 

These integers may also be counted using analytic number theory. By Perron's formula, we have 



and these sums may be estimated by shifting the contour and using the functional equation of the 
zeta function. This functional equation relies on Poisson summation, and so again relies on the 
question's interpretation as a lattice point counting problem. 

Although even the most diehard fan of contour integration would likely prefer the trivial proof, 
this analytic method also generalizes usefully to the cubic case, where the Riemann zeta function 
is replaced with a Shintani zeta function. 

These two counting techniques, in combination with the inclusion-exclusion sieve, are the starting 
points of the geometric and analytic proofs of Theorems 11.11 and II. 2^ respectively! 

We note one additional point related to (12. 3p . Wright [H] observed that this Dirichlet series has 
the beautiful representation 



and proved this in a much more general framework. (He obtained similar formulas for degree n 
cyclic extensions of any number field.) His results follow from considering a nontrivial 'twist' of the 
adelic zeta function of Tate's thesis [38]. This twist makes the affine line into a prehomogeneous 
vector space, with the action (j) of GL(1) given by (/){t)x = t^x. 

Essentially, a vector space is prehomogeneous if it has an action of an algebraic group G, which 
is transitive over C apart from the vanishing locus of an irreducible polynomial. This "prehomoge- 
neous" property is essential both in Bhargava's work and in the zeta function approach pioneered 



(2.5) 




(2.6) 
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by Sato and Shintani [33j; cubic, quartic, and quintic fields are parameterized by lattice pointqj up 
to the action of G(Z), and these may be counted geometrically or analytically. 

Wright's work on (12. and its generalizations mirrors his work with Datskovsky [401 [T3l [14] 
on the Shintani zeta function associated to cubic fields, and this latter work is at the heart of 
the analytic approach to counting cubic fields. In followup work, Wright and Yukie [42] proposed 
a program to enumerate quartic and quintic fields by studying the zeta functions associated to 
appropriate prehomogeneous vector spaces. So far their program has not succeeded; however: 

n = 4, 5. Bhargava ^ i6j proved the asymptotic formulas 

(2.7) N,{X, S,)^^l{{l+p'^- _ . X, 

V 

1 S 

(2.8) 7V5(X) ^ _ (1 + _ _ . X 

V 

The count N/^{X^ S4) includes only quartic fields with Galois group 54, and ()2.7p implies an asymp- 
totic for N4{X) in combination with work of Baily [T], Cohen, Diaz y Diaz, and Olivier [9], and 
Wong [39]. Bhargava applies the geometric approach alluded to in the A^2(^) case (and described 
later for Ns{X)), and his work may be considered an extension of the methods described in Sections 
[3] and [61 

n > 5. For n > 5, Bhargava conjectured [1] that Nn{X,Sn) ~ C„X. The explicit constants C„ 
have natural interpretations as Euler products; each Euler factor represents the 'probability' that 
a field K of degree n has a certain localization. Ellenberg and Venkatesh |19] have proved the best 
upper bound to date; namely iV„(X) <C X^^p^^^v^^). 

It is unclear whether prehomogeneous vector spaces may be used to obtain formulas for Nn{X) 
for n > 5. By a classification theorem of Sato and Kimura [32j, it is known that there are no 
reduced, irreducible, and reductive prehomogeneous vector spaces parameterizing rings of rank 
> 5; it is not yet known whether there are any prehomogeneous vector spaces not satisfying these 
assumptions which parameterize such rings. 

2.1. Counting torsion elements in class groups. The problem of counting 3-torsion elements 
in class groups of quadratic fields is related to that of counting cubic fields. If D 7^ 1 is a fundamental 
discriminant, then class field theory provides a bijection between subgroups of Cl(Q(\/l))) of index 
3, and unramified cyclic cubic extensions of Q(\/^). Such extensions, in turn, correspond to (non- 
Galois) cubic extensions of Q whose discriminant is D. Therefore, counting 3-torsion elements in 
quadratic fields is equivalent to counting cubic fields whose discriminant is fundamental, which are 
those not totally ramified at any prime. 

Therefore, this problem may be treated as a variation of the problem of counting cubic fields. 
This is the approach of Sections [Sj [5l and[6l however, there are other approaches as well, such as 
that described in Section [Tj A general heuristic for statistics of torsion elements in class groups 
was proposed by Cohen and Lenstra [TT]; their heuristic has inspired a great deal of interesting 
followup work, but for the most part the Cohen-Lenstra heuristics remain wide open. 

3. Davenport-Heilbronn, Delone-Faddeev, and the main terms 

In this section we discuss the proofs of the main terms in Theorems 11.11 and II. 2i To summarize, 
the idea is as follows: We count cubic fields by counting their maximal orders, and so we count all 
cubic orders and then sieve for maximality. We count cubic orders by counting cubic rings and then 



'Not all of the G(Z)-orbits correspond to fields, as we will see in the cubic case. 
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subtracting the contribution of the reducible rings, and these correspond to binary cubic forms. 
(The space of binary cubic forms is prehomogeneous in the sense described previously.) Thus at 
last we have a geometric lattice-point counting problem, which can be solved by "brute force", 
so to speak. This interpretation in terms of lattice points also opens the door to the use of zeta 
functions. 

We recommend the paper of Bhargava, Shankar, and Tsimerman [7J for the simplest self- 
contained account of the Davenport-Heilbronn theorems, with complete proofs. 

We begin by defining cubic rings and cubic forms. A cubic ring is a commutative ring which is 
free of rank 3 as a Z-module. The discriminant of a cubic ring is defined to be the determinant of 
the trace form {x,y) = Tr(xy), and the discriminant of the maximal order of a cubic field is equal 
to the discriminant of the field. 

The lattice of integral binary cubic forms is defined by 

(3.1) Vi ■■= {au^ + bu'^v + cuv"^ + dv^ : a,b,c,d e Z}, 
and the discriminant of such a form is given by the usual equation 

(3.2) Disc(/) = b'^c^ - 4ac^ - Ab^d - 27a^d^ + ISabcd. 
There is a natural action of GL2(Z) (and also of SL2(Z)) on Vz, given by 

(3.3) ^^.f)(^u,v) = ^f{{u,v).^). 

We call a cubic form / irreducible if f{u, v) is irreducible as a polynomial over Q, and nondegenerate 
if Disc(/) / 0. 

Cubic rings are related to cubic forms by the following correspondence of Delone and Faddeev, 
as further extended by Gan, Gross, and Savin pO] : 

Theorem 3.1 ( |16l I20j ). There is a natural, discriminant-preserving bisection between the set of 
GL2{1') -equivalence classes of integral binary cubic forms and the set of isomorphism classes of 
cubic rings. Furthermore, under this correspondence, irreducible cubic forms correspond to orders 
in cubic fields. 

Finally, if x £ Vz is a cubic form corresponding to a cubic ring R, we have StabGL2(Z)(^) — 
A\xt[R). 

It is therefore necessary to exclude reducible and nonmaximal rings. The nonmaximality condi- 
tion is the more difficult of the two to handle, and for this Davenport and Heilbronn established 
the following criterion: 

Proposition 3.2 ([15,^7]). Under the Delone- Faddeev correspondence, a cubic ring R is maximal 
if any only if its corresponding cubic form f belongs to the set Up C Vz for all p, defined by the 
following equivalent conditions: 

• The ring R is not contained in any other cubic ring with index divisible by p. 

• The cubic form f is not a multiple ofp, and there is no GL2{'Z) -transformation of f{u,v) = 
au^ + bu^v + cuv"^ + dv^ such that a is a multiple of p^ and b is a multiple of p. 

In particular, the condition Up only depends on the coordinates of / modulo p^. 

Therefore one can count cubic fields as follows. One obtains an asymptotic formula for the 
number of cubic rings of bounded discriminant by counting lattice points in fundamental domains 
for the action of GL2(Z), bounded by the constraint |Disc(x)| < X. The fundamental domains may 
be chosen so that almost all reducible rings correspond to forms with a = 0, and so these may be 
excluded from the count. 
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One then multiplies this asymptotic by the product of all the local densities of the sets Up. This 
gives a heuristic argument for the main term in (II. ip . and one incorporates a sieve to obtain a 
proof. 

The main term of Theorem 1 1 . 2 1 may be proved similarly. By class field theory, there is a bijection 
between subgroups of Cl3(Q(\/^)) of index 3, and cubic fields of discriminant D, which are precisely 
those which are not totally ramified at any prime. This ramification condition may also be detected 
by reducing cubic forms modulo p^, and thus may be incorporated into Proposition 13. 2[ The 
remainder of the proof is the same. 

3.1. The work of Belabas, Bhargava, and Pomerance. In [2], Belabas, Bhargava, and Pomer- 
ance (BBP) introduced improvements to Davenport and Heilbronn's method, and obtained an error 
term of 0{X'^^^^'') in and ()1.2p . They began by observing that 

(3.4) iV3±(X) = ^/i(g)iV±(Q,X), 

where N^{q,X) counts the number of cubic orders of discriminant < ibD < X which are 
nonmaximal at every prime dividing q. This simple observation has proved to be quite useful! For 
large q, BBP prove that N^{q,X) < X3'^(9)/g2 using reasonably elementary methods. Therefore, 
one has 

(3.5) N^iX) = Y,KQ)N^iq,X) + 0{X/Q'-^). 

q<Q 

For small q, BBP estimate N^{q,X) with explicit error terms using geometric methods. These 
error terms are good enough to allow them to take the sum in (j3.4p up to {X log X)^/^, which yields 
a final error term of 0(X^/8+'^). 

In addition, their methods extend to counting S^-quartic fields, where they obtain a main term 
of C^X with error < x23/24+e, 

4. The "Four Approaches" 

This, then, brings us to Theorems ll.ll and ll.2[ Theorem 11.11 was conjectured in print by Roberts 
|31j . and implicitly in the foundational work of Datskovsky and Wright [401 [T3l I14| . Roberts gave 
substantial and convincing numerical evidence for his conjecture, as well as the heuristic argument 
described in Section [5l 

In this paper we will discuss four very different approaches, all independent, to these secondary 
terms. 

Shintani zeta functions (Taniguchi-T. \36\ I37j). Taniguchi and the author further devel- 
oped the theory of Shintani zeta functions, and proved Theorems 11.11 and 11.21 with error terms of 
(9(X^/^+'^) and 0{X^^^'^^~^''), respectively. These proofs closely follow the heuristic arguments of 
Roberts and Datskovsky- Wright. 

The Shintani zeta function approach is quite flexible, and p6] contains several generalizations of 
the main theorems. Perhaps the most interesting of these is a generalization to counting cubic field 
discriminants (or 3-torsion elements of class groups) in arithmetic progressions, where a certain 
"non-equidistribution" phenomenon arises. For example, when counting cubic field discriminants 
= a (mod 7), the secondary term in Theorem 11.11 is different for every residue class a (mod 7). 

A refined geometric approach (Bhargava, Shankar, and Tsimerman [7j). Bhargava, Shankar, 
and Tsimerman gave the first proof of Theorem ll.il with an error term of 0{X^^/^^+^). Their proof 
follows Davenport and Heilbronn's original work, counting lattice points via geometric arguments. 
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with substantial simplifications and improvements. Indeed, these improvements are already evident 
in their proofs of the classical Davenport-Heilbronn theorems. 

Their proof of the secondary term is based on a "slicing" argument which we describe briefly in 
Section [6l Their treatment of the error terms introduces a certain correspondence for nonmaximal 
cubic rings; this correspondence has also proved useful in the context of Shintani zeta functions, 
and a combined approach ([8], in progress) has yielded an error term of 0{X^/^+'). 

Equidistribution of Heegner points (Hough [23]). Hough obtained a statement closely 
related to Theorem 11.21 His scope is limited to counting 3-torsion elements in imaginary quadratic 
fields Q(\/^) with D = 2 (mod 4); in this context, he proves a version of Theorem 11.21 but with 
an error term larger than X^^^ . In followup work (in preparation), he proves the secondary term 
for a smoothed variation of this sum. 

Hough derives this result as a consequence of an equidistribution theorem for the Heegner points 
associated to the 3-part of class groups of imaginary quadratic fields; his proof of the Davenport- 
Heilbronn theorem appears as a bonus, and without reference to binary cubic forms. His techniques 
also apply to the fc-parts of these class groups, for odd k > 5, where he obtains secondary terms (of 
order X^/^+i/fc). His error terms are currently larger than both the secondary and the main terms 
for k > 5, but this work sheds light on a notoriously difficult problem. 

Trigonal curves and the Maroni invariant (Zhao [44]). Zhao enumerates cubic extensions 
of the rational function field ¥q{t) using algebraic geometry. Such extensions are in bijection with 
isomorphism classes of smooth trigonal curves, that is, smooth 3-fold covers of P^, and these may be 
counted by embedding them in certain surfaces F^. He obtains a secondary term as a consequence 
of a bound for the integer k, called the Maroni invariant; for now his error terms are larger than 
this secondary term. 

In all of these approaches, the secondary term is easier to see than it is to prove. In particu- 
lar, each of these approaches yield the secondary term in a natural way, but it is not a priori 
evident that the error terms can be made smaller than X^^^l 

In what follows, we will describe, insofar as we can in a couple of pages, why each of these 
approaches naturally yields a secondary term. For the Shintani zeta function approach, we will say 
only a little about the error term, and for the other three approaches we will say nothing. Indeed, 
we will use the notation 0{- ■ ■) whenever the details are best left to the respective papers. 



5. The Shintani Zeta Function Approach 

Taniguchi and the author [36] proved Theorems 11.11 and 11.21 using the analytic theory of Shintani 
zeta functions. The Shintani zeta functions associated to the space of binary cubic forms are 
definecH by the Dirichlet series 

(5.1) C^{s):= |o, u ^ , |Disc(x)|-= ^ ^_^|Disc(i?)|-. 

^ \htabQhom\x)\ , , Aut(it) 

xGGL2(Z)\y± ' ^i^2W^ l\ ±Disc(i?)>0 ' ^ ^' 



In the former sum, V-^ is the lattice defined in (|3.ip . the sum is over elements of positive or negative 
discriminant respectively, and Stab(x) is the stabilizer of x in GL2(Z). The latter sum is over 
isomorphism classes of cubic rings. The equality of these two sums follows from the Delone-Faddeev 
correspondence (Theorem 13. ip . 



^The Shintani zeta functions are commonly defined in terms of SL2(Z) rather than GL2(Z), multiplying (|5.f p by 
a factor of 2. 
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This is an example of a zeta function associated to a prehomogeneous vector space; in this case 
the word 'prehomogeneous' reflects the fact that GL2(K) acts transitively on the positive- and 
negative-discriminant loci V^. Sato and Shintani [33] developed a general theory of zeta functions 
associated to prehomogeneous vector spaces, and for the space of binary cubic forms Shintani proved 
[34| that these zeta functions enjoy analytic continuation and an explicit functional equation. These 
zeta functions have poles at s = 1, and, anomalously, at s = 5/6. 

Shintani's work opens the door to the study of cubic fields using analytic number theory. In 
particular, by Perron's formula and standard techniques we have 
(5.2) 

±Disc(x)<X 

Although the left side is not the counting function of cubic fields, for the first time we see the X^l^ 
secondary term. This, therefore, gives an explanation for the secondary terms in Theorems 11.11 and 
II. 2t because the Shintani zeta functions have secondary poles@ 

To count cubic fields, the most important step is sieving for maximality. This is possible by 
work of Datskovsky and Wright |40 1 113 1 [H]. who gave Shintani's work an adelic formulation along 
the lines of Tate's thesis [38]. This allowed them to incorporate a variety of conditions into the 
definition of the Shintani zeta functions, including the Davenport-Heilbronn maximality conditions 
(given in Proposition 13. 2p . 

A bogus proof of Roberts' conjecture is as follows. For a set of primes define the V-maximal 
Shintani zeta function by the Dirichlet series (j5.ip , with the added condition that the only cubic 
forms counted are those in the set Up for each p £ V. Therefore, letting V be the set of primes 
< X, it follows that 
(5.3) 

/ 1 /•2+ioo v-s a 

E lA = / ^$is)—ds = ReSs=i^${s)X + -Res,=,/,C${s)X^^' + 0{X^/'^^), 

where the sum is over all maximal cubic ring^, or equivalently cubic fields, with ibDisc(-R) < X. 
Unfortunately the error term depends on the particular zeta function ^-p, and therefore the implied 
constant depends on X (indeed, when standard techniques are used, exponentially so!) 

However, this flawed argument enabled Datskovsky and Wright [Tl] to give a rigorous analytic 
proof of the Davenport-Heilbronn theorem. The key step@ is to use (|5.3p , but with V equal to the 
set of all primes < Y, where Y tends to infinity very slowly with X. Given their adelic framework, 
their work yielded an asymptotic formula for the number of cubic extensions of any global field (of 
characteristic not 2 or 3). 

Moreover, Roberts [31] computed the limit of the secondary terms in (j5.3p and found an excellent 
match with numerical data. This led him to conjecture that (15. 3p (equivalently, (II. ip ) was correct 
with some undetermined error term smaller than X^^^. 



^ Of course, this begs the question of why the Shintani zeta functions have poles at s = 5/6. This can of course 
be explained by the various calculations in Shintani's work, but for now a satisfying highbrow argument is unknown 
to the author. 

Reducible maximal cubic rings correspond to fields of degree < 2, which can be separately counted and therefore 
subtracted with good error terms. 

®We have adapted their argument somewhat to our point of view. Datskovsky and Wright also need, as did 
Davenport and Heilbronn, the bound (|3.5|) for the number of cubic rings nonmaximal at any prime > Y . 
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He closed his paper with the following (paraphrased very slightly): 

The pessimistic discussion in [U] suggests to us that the way may be difficult. 



However, one ingredient of a proof might be the functional equation of ^p(s) with 
respect to s — >• 1 — s, studied in 001 [13]. Another ingredient might be [[33], Thm. 
3], which concerns growth of arithmetic functions whose associated Dirichlet series 
satisfy such a functional equation. 

This is actually a good summary of our proofs of Theorems 11.11 and 11.21 To get the ball rolling, 
we needed to incorporate the sieve used by Belabas, Bhargava, and Pomerance described in Section 
13. 1[ In place of the P-maximal zeta function we introduced the q-nonmaximal zeta function, which 
counts only cubic rings not maximal at each prime dividing q. This allowed us to count maximal 
cubic rings, and the irreducible such rings correspond to cubic fields. 

Remark. We count 3-torsion elements of class groups by a variation of this argument. Such elements 
correspond to cubic fields not totally ramified at any prime, and so we expand the definition of the 
g-nonmaximal zeta function to count cubic rings either nonmaximal or totally ramified at every 
prime dividing q. 



The equation ()5.3p holds for each g-nonmaximal zeta function, and the g-dependence of the error 
term can be bounded in terms of the exponential sum 

1 



(5.4) 



2^6^. 2 



E 



^q{y)exp{27ri[x,y]/q 



q2z y^^Z/g^Z 



where ^q{x) is the characteristic function of the Davenport-Heilbronn g-nonmaximality condition, 
and [x,y] = x^yi — ^X3y2 + ^X2y3 — xiy^. Each inner sum appears naturally when we shift the 
contour in (j5.3p to the left of 5R(s) = and apply the functional equation. Trivially, the sum in 
(j5.4p is bounded by q^; this is enough to allow us to take Q = X^^"^^ in p.5p and obtain an error 
term of x'^'^/'^^^'' in the Davenport-Heilbronn theorems. 

In |36] we incorporated several improvements to lower the error terms in Theorems 11.11 and 11.21 
to 0{X'^ /^~^^) and 0(X^^/^^"'"^), respectively. Among these is a careful analysis of the sum in ()5.4p . 
carried out in [37j. For each x we obtained exact formulas for the inner sum in (15. 4p . proving that 
the outer sum is <C g^"'"'^. 

5.1. Non-equidistribution in arithmetic progressions. The Shintani zeta function approach 
can also be used to study the distribution of cubic field discriminants in arithmetic progressions. 
To do this, one twists the Shintani zeta functions by Dirichlet characters, obtaining L-functions 
which are also proved to enjoy analytic continuation and a functional equation. 

The details are complicated, so here we simply present representative numerical data. The tables 
below list the number of cubic fields K with < Y)isc{K) < 2 • 10^ and Disc(i^) = a (mod m), for 
m = 5 and m = 7, and each residue class a. 



Discriminant modulo 5 





1 


2 


3 


4 




Actual count 




21277 


22887 


22751 


22748 


22781 




Theoretical result 


21307 


22757 


22757 


22757 


22757 




Difference 




30 


130 


6 


9 


24 




Discriminant modulo 7 





1 


2 


3 


4 


5 


6 


Actual count 


15330 


17229 


14327 


15323 


17027 


18058 


15150 


Theoretical result 


15316 


17209 


14277 


15316 


17024 


18063 


15131 


Difference 


14 


20 


50 


7 


3 


5 


19 
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The data modulo 5 is essentially equidistributed, apart from the a = column. (The relative deficit 
of cubic fields divisible by 5, or by any other modulus, can be explained by a careful reading of 
Davenport and Heilbronn's original paper.) However, the data modulo 7 shows a striking lack of 
equidistribution0 

The theoretical results above come from a generalization of Theorem 11.11 to arithmetic progres- 
sions; the secondary term now involves a sum of residues of twisted Shintani zeta functions. These 
residues are evaluated in ^7}, and for characters x for which 7^ 1> the associated residue at 
s = 5/6 is zero. However, when = 1 this residue is often nonzero, and nonzero residues explain 
biases in arithmetic progressions such as the one seen above. 

6. A Refined Geometric Approach 

To give an overview of Bhargava, Shankar, and Tsimerman's proof [7], we start with their 
treatment of the main term in Theorem 11.11 Later, we will describe how the secondary term 
appears in their work, but only in the context of irreducible cubic forms (corresponding to cubic 
orders). 

The argument in [7] largely follows Davenport and Heilbronn's original work. Fix a sign, and let 
denote the irreducible points x G Vz with ibDisc(3;) > 0. Write n+ = 6, n_ = 2 for the order of 
the stabilizers of the action of GL2(M) on V^. The equality n-(_ = 3n_ reflects the fact that cubic 
fields of negative discriminant are three times as common as those of positive discriminant. 

Write J- for a certain fundamental domain for GL2(Z)\GL2(M) in GL2(M) (see (12) of |7]), 
originally constructed by Gauss. For any v G V^, consider the multiset J-v, where the multiplicity 
of an element x G is equal to the number oi g (z J- for which gv = x. It is readily checked that 
each element x G Gj\y.^ is represented in this multiset n±/m{x) times, where m(x) denotes the 
size of the stabilizer of x in GL2 (Z) . We conclude that 

1 _ 1 

|Stab(x)| n± 



3.1) N{V^-X):= nTTuTZm =— {^e-F^;nV^± : |Disc(x)|<X}. 



0<±Disc(x)<X 
X irred. 

The reducible points can be handled without too much difficulty: The number of reducible 
integral binary cubic forms au^ + bu^v + cuv'^+dv^ in the multiset IZxiv) := {w G Fv : |Disc(t(;)| < 
X}, satisfying a 7^ 0, is ^ X^/^^''. Conversely, all cubic forms with a = are reducible. This 
condition a 7^ meshes well with the geometric arguments that follow. In addition, the weight 
|g^.^^^,^l occurring in (|6.ip can be neglected, as there are <C X-*^/^ points x with |Disc(x)| < X with 
nontrivial stabilizer. 

The formula in (j6.ip does not depend on v, and one innovation of [7] (previously used by Bhargava 
in [3]) is to average over many v. In particular, define B := {w = (a, b,c,d) £ V : 3a^ + 6^ + + 
3d^ < 10, |Disc(i(;)| > 1}, and we have 

: |Disc(x)|<X}|Disc(^)|-M^; 
LeBnv± \^^sc{v)\-'^dv 

The authors describe this step as "thickening the cusp". As |Disc(t;)|^-'^(if is GL2(M)-invariant, we 
may rewrite this as 

(6.3) N{V^;X) = ^l^^^#{x G V.f^DgB : |Disc(x)| < X}dg, 



^ The counting functions for cubic field discriminants = (mod 7) and = 3 (mod 7) are not the same; the equality 
of the theoretical results above is a coincidence of roundoff error. 
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where := ^ IveBnv^ 10180(^)1 ^dv. This is rewritten as 

(6.4) N{V^,X) = -^[ #{x £V^^nB{n,t,X,X)}t-'^dnd''td''X 

^1 JgeN'{a)A'A 

using a standard decomposition of T ((12) of [7J), where B(n,t, X, X) is the region {x G gB : 
|Disc(x)| < X}. A proposition of Davenport estabhshes that in this situation, the count of lattice 
points above is weh approximated by the volume of B{n,t, X, X), provided that t is not too large. 
Conversely, when t is large all the lattice points are in the cusp a = 0, and hence reducible and not 
counted. Thus, the above is reduced to 

(6.5) NiV^,X) = ^ L^,.(„)^.A Vol(i3(n, t, A, X))r^ dnd'' td>' A + 0(- • • )• 

t<Cl/3Al/3 

The condition on t is now removed, subject to an error term, and this integral is equal to 

(6.6) Ar(y±,x) = -i-^ [ Vol(7^x(^;))|Disc(^;)|-^d^. + 0(- • • )• 
This volume does not in fact depend on v, so that we have (excluding error terms) 

(6.7) N{V^,X) = ±Vol(7ex(^;)) = -^X. 

6.1. Origin of the secondary term. We now explain how Bhargava, Shankar, and Tsimerman 
refine these calculations to obtain a secondary term. Our brief explanation will necessarily be 
somewhat vague, which should encourage the reader to read [7]. We focus on their count of 
irreducible binary cubic forms; as in [21 [36], they also incorporate a sieve to count cubic fields. 

They begin by incorporating several tweaks to (j6.4p . For technical reasons, they count discrim- 
inants in dyadic intervals [X/2,X]. In addition, they introduce a smooth function ^'o(i) on M>o, 
such that ^^{t) = for t < 2 and ^o{t) = 1 for t > 3. They thus rewrite (j6.4p (restricted to a 
dyadic interval [X/2,X]) as 

(6.8) A.(yMA72,Xl, = ^^^^_^^^^ J*„(i5,)+*(i53 

#{x e Vi^^nB{n,t,X, [X/2,X])t-'^dnd''td''X, 

where ^ := 1 — ^'o, and k is a parameter to be chosen later (to minimize error terms). The 
decomposition ^' + = 1 splits this integral into two, and we restrict our attention to the 
part, which yields the secondary term. 

They now "slice" the count of binary cubic forms by the first coordinate. In particular, for 
a G Z, let Ba{n,t,X, [X/2,X]) denote the set of binary cubic forms in Ba{n,t,X, [X/2,X]) whose 
u"^ coefficient is equal to a. Then, we have 

(6.9) #{x G V^^ n B{n, t, A, [X/2, X])} = J] #{x G V^^ D Ba{n, t, A, [X/2, X])} . 

Skipping ahead in their work, the ^'o-contribution to (j6.8p is equal to 

oo . /7,l/3.-\ \10/3„,l/3 

(6-10) ^E/ ^ / M%^)^^^MBu{[X/{2X%X/X']))dXudXX, 



12 



FRANK THORNE 



where Ba{\Y /2^Y]) denotes the set of cubic forms in B with first coordinate a and discriminant in 
[y/2,y]. By Melhn inversion, we have 

oo / 1/3 \ 

(6.11) ''""M^) = ^(1/3) + 3$o(-2)(kM^/^ + 0(- • • ), 

a=l ^ ^ 

and it is this (negative) ^(1/3) which contributes the secondary term. In contrast, the second term 
of ()6.1ip (which "looks larger", especially in light of the eventual choice k = X^^^"^) is reincorporated 
into ()6.10p and then combined with the ^ term of ()6.8p to obtain the main term in ()6.7p . When the 
C(l/3) term is plugged into ()6.10p . the resulting integral can be evaluated without undue difficulty, 
and it has order X^/^ . 

Remark. As Shankar has explained to the author, a more geometric argument can also be given, 
which does not rely on the analytic continuation of the zeta function. However, it is unclear whether 
the resulting error term would be smaller than X^/^ , when combined with the sieve. 

6.2. A correspondence for cubic forms. To count cubic fields one must sieve for maximality, 
using the Davenport-Heilbronn conditions of Proposition 13.21 These could be treated in a naive 
manner, but the authors improve their error terms by introducing a useful correspondence for 
nonmaximal cubic forms 

We introduce some notation. Let N^(yz]X) count all cubic forms v with < ibDisc(z;) < X. 
For a prime let N^{Up;X) count those cubic forms which are nonmaximal at p, i.e., which do 
not lie in the set Up defined in Proposition 13.21 (In referring to cubic forms as "nonmaximal" we 
are implicitly appealing to the Delone-Faddeev correspondence.) Finally, for any a G P^(Z/pZ), 
let N^(Vp^a', X) count those cubic forms v such that the reduction of v modulo p has a as a root. 

With this notation, we have the following correspondence: 

Proposition 6.1. [7J We have the identity 

(6.12) N^{U;;X)= N^{Vp,a;X/p^)- ^ N^{Vp,^;X/p'') + N^{Vz;X/p'). 

This identity generalizes in a straightforward way from prime p to squarefree q, where we count 
cubic forms not in Up for any p\q. The identity amounts to a combinatorial argument, enumerating 
ways in which a nonmaximal cubic ring is contained in a larger ring (with smaller discriminant), 
and then counting these larger rings. 

This identity is crucial in [7J , and it can also be translated into an identity for the (7- nonmaximal 
Shintani zeta function! In work in progress by Bhargava, Taniguchi, and the author [8] we are 
developing this combined approach. We have proved an error term of 0{X^/^^^) in Theorem 1 1.1 1 as 
well as a secondary term for relative cubic extensions of quadratic base fields, and we are currently 
working on further extensions and generalizations. 

7. Equidistribution of Heegner Points 

Remarkably, Hough [23] demonstrated that the Delone-Faddeev correspondence is not the only 
path to the Davenport-Heilbronn theorem. Hough studied the distribution of Heegner points as- 
sociated to 3-torsion elements of ideal class groups of quadratic fields, and he obtained a result 
related to Theorem 11.21 as a consequence. 



As Shankar explained to me, in the case of cubic fields it is this correspondence which allows them to cross the 
X^/'^ barrier. 
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For simplicity, Hough worked only with imaginary discriminants D = 2 (mod 4); note that any 
such discriminant is not fundamental, and the discriminant of the field Q(\/l)) is 4D0 He proves 
that 

(7.1) V #CU-D) = ^X + O(Xi9/20+.)^ 

0<D<X 
D=2 (mod 4) 

and for "good"El test functions (/>, he proves that 

(7.2) *Chi-D)<p{D/X) = + 0{X^/'+^). 

0<D 
D=2 (mod 4) 

The correct secondary term of order X^^^ appears in both formulas, in spite of the larger error 
terms, and in in followup work (in progress) he obtains an error less than X^^^ in ()7.2p . thereby 
obtaining a proof of a related secondary term. 

Hough's work has very different prospects for generalization from the approaches described pre- 
viously. In particular, his approach can be used to study /c-torsion in class groups for odd A; > 3, 
and he makes the following conjecture: 

Conjecture 1 (Hough [23]). For good test functions (f) and odd k > 3, we have 

(7.3) J2 *CU-D)<PiD/X) = ^0(1)X + Ci,k^(^ + i'jxi/^+iA + o(Xi/2+iA), 



0<D 
D=2 (mod 4) 



where 



p-l/k_p-l+2/k_p-l+l/k_p-l 



Indeed he "proves" his conjecture, but with error terms larger than X. Even a proof of the 
main term for the single case k = 5 would be a major achievement. Indeed, the average size of 
Clp(— D) was conjectured by Cohen and Lenstra [IIJ to be 2 for each prime p > 3, and Cohen and 
Lenstra further conjectured that the p-part of Clp{—D) is isomorphic to a fixed p-group H with 
probability proportional to |Aut(jj-)| ■ ^^^^ these conjectures is currently known for any p > 5, 
and essentially nothing is known for p = 3 beyond Theorem 11.21 

7.1. Heegner points and equidistribution. Hough's main result concerns the equidistribution 
of Heegner points associated to the 3-part of the class group. We recall the relevant background. If 
= [xi,X2] is an ideal of an imaginary quadratic field Q{\/—D), where by reordering if necessary 
we have xi/x2 G M := {z £ C : '^{z) > 0}, we call xi/x2 the Heegner point z^ associated to 
the ideal a. A change of basis for a corresponds to a linear fractional transformation Za — )• ff^^ 
with ad — be = 1, and therefore the Heegner point Za is well-defined in the quotient SL2(Z)\]H[. 



-'^^It seems that the restrictions that D = 2 (mod 4) and that D be negative could both be hfted with some 
effort. The restriction on the sign is the more serious of the two, as elements of class groups of Q(V^) for positive 
D correspond to closed geodesies rather than Heegner points. However, Duke's result holds for either sign, so in [23] 
Hough expresses optimism that the positive discriminant case of Theorem [T3] could be handled as well. 
Compact support and infinitely difi'erentiable. 
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One checks that Za = Zaa for any a G Q{\/—D), and hence depends only on the class of a in 
Cl{\^—D). One similarly checks that Za / Zf, if o and b are inequivalent in Cl{\/—D). 

Therefore, there are h{—D) Heegner points associated to Q{\/—D) in SL2(Z)\H, and in |17] 
Duke proves that they equidistribute with respect to the hyperbolic measure "^^^f^ as D — )• oo. 
Hough proves the same for the Heegner points associated to the 3-part of the ideal class group: 

Theorem 7.1 (Hough [23J). Let B be a bounded Borel measurable subset o/H having boundary of 
measure zero. Then as D ^ oo, 

(7.5) ^ #{a primitive, nonprincipal : [a] G CZ3(Q(\/^^)),Za G ^vol{B). 

o>-D>~x ^ 

d=2 (mod 4) 
squarefree 

Here we take i? C H rather than B C SL2(Z)\]H[, so that each Heegner point is counted once for 
each representative of its SL2(Z) orbit in B. (We recall that a primitive ideal is one without any 
rational integer divisors other than 1.) 

Straightaway one can see that this is an equidistribution statement. But it also implies the 
Davenport-Heilbronn theorem! Taking B to be the standard (or any other) fundamental domain 
for the action of SL2(Z) on H, one counts each Heegner point (and thus each nontrivial element of 
C\3{Q{\/ —D))) exactly once. The Davenport-Heilbronn theorem follows from the classical compu- 
tation vol(i?) = f . 

We will follow Hough's approach sufficiently far as to see the secondary term. His work begins 
with the following parameterization of ideals (and therefore Heegner points), essentially due to 
Soundararajan [35j . 

Proposition 7.2. |23l [35] Let D = 2 (mod 4) be squarefree and let k > 3 be odd. The set 

(7.6) {{l,m,n,t) £ (Z-^)^ : Im^ = fn^ + t^D, l\D, {m,ntD) = l} 

is in bisection with primitive ideal pairs (o, a) with a 7^ 1 and 0*^ principal in Q{\^—D). Explicitly, 
the ideal a is given as a Z-module by 

a = [lm,lnt~^ + 

where Na = Im and t^^ is the inverse oft modulo m. 

The proof is relatively straightforward. For example, if a'^ is principal and coprime to D, then 
al' = {n + tyJ—D) and taking norms yields (Na)^ = + t^D, a solution to (j7.6p . Ideals a not 
coprime to D yield solutions to (j7.6p with / > 1. 

As one of the two main steps in his proof. Hough now applies this parameterization to count 
Heegner points in the region 

(7.7) i2y := {z G M : -i < 5R(z) < 1, 3(z) > 1}. 

(This establishes the vertical distribution of Heegner points, and he separately establishes that they 
are equidistributed horizontally.) He proves that the number of Heegner points in Ry is equal to 

(7.8) l_YX + C,/,X'/' + 0{---), 

for a fairly complicated error term which depends on both X and Y. The secondary term appears 
here for the following reason: By the parameterization above, we may write an ideal a as [Im, Inf^^ 
\/—D]. We have Im^ = Pn^ + t^D, so that Im > D^^^, implying that the Heegner point Za has 
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imaginary part < D^/^ < X^/^ . In other words, the Heegner points equidistribute as X — t- oo, 
but for fixed X they do not appear high in the cusp. 

The hyperbohc volume of the subset of Ry with "^{z) > X^l'° is equal to 

(7.9) r r 

Jx=-l/2 Jy=Xi/6 r 

Accounting (much more carefully than done here!) for the fact that this subset contains no Heegner 
points yields the secondary negative term in (j7.8p . and an analogous argument explains the second 
term in (17.31). 



Remark. Hough's methods also extend to arithmetic progressions, where he finds the same lack of 
equidistribution observed in Section 15.11 some results along these lines are in preparation. 

8. HiRZEBRucH Surfaces and the Maroni Invariant 

Zhao's work [44] is still in preparation, so we offer only a very brief overview here. Using algebraic 
geometry, Zhao estimates the number of cubic extensions of the rational function field Fg(t). 

Fix a finite field of characteristic not equal to 2 or 3, and let S{2N) be the number of 
isomorphism classes of cubic extensions K/¥g(t) of degree 3, such that |Disc(-F)| = g^^. In pTi] , 
Datskovsky and Wright proved that 

(8.1) Si2N) = 2^^^^^^^,^- + oi,^% 

CF,(t)l3) 

where CFq{t)(s) is the Dedekind zeta function of ¥q{t), or equivalently the zeta function of the 
algebraic curve Pj- . 

Datskovsky and Wright's proof uses the adelic Shintani zeta function; in |44j Zhao obtains another 
proof of (18. ip using algebraic geometry. A trigonal curve is a 3- fold cover of P^, and there is a 
bijection between isomorphism classes of smooth trigonal curves and cubic rational function fields. 
Already this perspective is enlightening; for example, the Riemann-Hurwitz formula implies that 
the discriminant has even degree, explaining why (18. ip is an equation for S{2N) and not S{N). 

The problem, then, is to count smooth trigonal curves. These may be counted by embedding the 
curves into Hirzebruch surfaces F^; for each trigonal curve C there is a unique nonnegative integer 
k, called the Maroni invariant of C, such that C embeds into = Ppi(0 ® 0(—k)). 

Zhao now counts curves on each surface -F^ and sieves for smoothness, using a sieve method 
related to one developed by Poonen |30] . However, the main technique is different and new. The 
main term in (j8.ip comes from estimating the number of trigonal curves C with each Maroni 
invariant k, and then summing the results over all integers k. However, the Maroni invariant of a 
smooth trigonal curve C is at most A^/3, provided that the field corresponding to C has absolute 
discriminant g^^. Limiting our sum to k < N/3 naturally introduces a secondary term of order 

Zhao's method, like all other methods described in this paper, comes with error terms; at present 
he has obtained an error term of 0(g^^/^) in He has some optimism that this error term can 

be reduced below q^^^^, thereby proving the secondary term. 

9. Conclusion 

Cubic fields have seen a great deal of recent study recently, and we recommend the papers of 
Cohen-Morra [1_2J and Martin-Pollack |25] (among others) for recent studies of cubic fields from 
other perspectives. 
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Many interesting open questions remain in the subject. For example, what is the "correct" error 
term in the Davenport-Heilbronn theorems? Even if we cannot prove it, it would still be interesting 
to determine the expected order of magnitude. To get the best error terms it seems that we should 
count fields K of degree at most 3, each weighted by | Aut(_R") | • '^^^ data in [31] suggests that the 
true error may be smaller than X^/^, and a comparison with the divisor problem suggests that the 
error might be on the order of X'^/^ . Nevertheless, the comparison with the divisor problem is not 
exact, and the numerical data is inconclusive. 

Perhaps the most compelling open question concerns quartic and quintic fields. Asmyptotic for- 
mulas were proved by Bhargava ^ [6j ; should these formulas have secondary terms? It is generally 
believed that they should; however, so far we lack even a good conjecture. The zeta function ap- 
proach seems likely to work, and Yukie [33] has made some progress in this direction; the geometric 
approach seems likely to work as well. However, for now the difficulties with either approach appear 
rather severe. 

Another open question concerns the multiplicity of cubic fields. It is believed that there should 
be <C cubic fields of discriminant in, but the best bound known is 0(n^/^+'^), due to Ellenberg 
and Venkatesh [19j. Nontrivial bounds were also obtained by Helfgott and Venkatesh [22j and 
Pierce [29], using a variety of methods. Any of the methods described in the present paper could 
potentially yield improvements, but none of them have succeeded to date. (The present author 
has attempted improvements by means of Shintani zeta functions, which have led to a number of 
instructve and interesting failures.) 

Finally, this paper begs the question of whether unexpected connections might be found between 
the four perspectives presented here, and some promising preliminary results have been obtained 
in this direction. We anticipate that this and related questions will be addressed in the near future. 
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